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Abstract — The partition function of a factor graph can some- 
times be accurately estimated by Monte Carlo methods. In this 
paper, such methods are extended to factor graphs with negative 
and complex factors. 

I. Introduction 

Let X\ , X% , . . . , Xm be finite sets, let X be the Cartesian 
product X = X\ x Xi x . . . x Xn, and let / be a function 
/ : X — > C. We are interested in numerically computing the 
partition function 

x£X 

for cases where N is large and / has some useful factorization. 

If / has a cycle-free factor graph (TJ, the sum ([T]i can be 
computed directly by sum-product message passing. In this 
paper, however, we are interested in cases where no such 
factor graph is available (e.g., when the factor graph of / 
is a rectangular grid as in Fig. [7J. 

In the important special case where / is real-valued and 
nonnegative (i.e., f(x) > for all x £ X), the probability 
mass function 

/ \ A f{x) 

Pf( x ) = -s— 
6f 



The paper is structured as follows. After introducing some 



(2) 



is the basis of a variety of Monte Carlo methods for esti- 
mating ([T]), see Q, J3J. Such Monte Carlo algorithms have 
successfully been used, e.g., for the computation of informa- 
tion rates of source/channel models with 2-D memory (4), 
Q. Note that p inherits factorizations (and thus factor graphs) 
from /. 

In this paper, we extend these Monte Carlo methods to 
the case where / is real-valued (but not nonnegative) or 
complex. The motivation for this extension is twofold. First, 
the Fourier transform of a function / preserves the topology 
of the factor graph, but generally results in complex factors 
[6 1, [7]. Second, factor graphs of probability mass functions 
in quantum mechanics naturally involve complex functions 
||8). In both cases, computing quantities of the form ([T]i is 
of supreme interest. 

In full generality, the computation of ([T| is, of course, 
intractable (already in the nonnegative real case), but good 
Monte Carlo algorithms may nonetheless work well for many 
cases of interest (as in the nonnegative real case). 



notations in Section 
in Sections ITTT1 and 



IN the proposed algorithms are described 
IV] The description focuses on the real 
case; the generalization to the complex case is outlined in 
Section [V] The proposed methods are illustrated by some 



numerical experiments in Section VI 



II. Probabilities and Partial Partition Functions 
We begin the proposed generalization by defining 



and the probability mass function 



P\f\( x ) 



\f(x)\ 



Z 



l/l 



(3) 



(4) 



which will replace Q in the Monte Carlo algorithms. Note 
that pifi also inherits factorizations (and thus factor graphs) 
from /. 

In the following, we restrict ourselves to the case where / 
is real (but not nonnegative); the generalization to the complex 
case is indicated in Section IVl 

Let 



X+ = {x £ X 
X~ ={x£ X 



X° = {x £ X 



f(x) > 0} 
f{x) < 0} 
/(*) = 0} 



and thus 



\x\ = \x+\ + \x-\ + \x°\. 

We then define the partial partition functions 

^ = E /(*) 



x£X + 



ZV=J2 f{x) 



J f 



and thus 



Zf 



J \f\ 



Z 1 



(5) 
(6) 
(7) 

(8) 

(9) 
(10) 



(11) 
(12) 



Finally, we define probability mass functions on X + and 
X~ as follows: 



and 



Pf 



, +, n a /(a;) 
a;i->pj(x) = 



(13) 



x^pjix)^^ (14) 



A simple, but key, insight is that sampling from p^ or from 
pj can be reduced to sampling from p^: since coincides 
with p^ on A" + (up to a scale factor), and with pj on 
(up to a scale factor), samples x^',x^ 2 \ . . . from can be 
partitioned into samples from and from pj according to 
the sign of f(x^'). For example, samples from pifi, and thus 
both from p^ and from pT, may be drawn using tree-based 
Gibbs sampling as in g), (5}, |(9). 

In this paper, we will now focus on the computa- 
tion/estimation of the partial partition functions Z~j and ZJ 
separately. If these estimates are sufficiently accurate, Z$ can 
then be computed from ( [TT] ). This approach is bound to fail, 
of course, if both Z^ and \Zj\ are large and their difference 
is small. However, this cancellation problem (which is well- 
known in quantum mechanics fTU) ) is beyond the scope of the 
present paper. 



III. 



Estimating Z^J and Z f 



We will now propose two different Monte Carlo methods to 
estimate the partial partition functions Z^J and ZJ . The first 
method uses uniform sampling and the second method uses 
samples from p~j(x) and pj (x). Both methods need the value 
of \X + \ and \X~\, the computation of which is addressed in 
Section HV] 



A. Uniform Sampling 

1) Draw samples x^^x^,. 



from X + , and samples 
formly from X~ . 
2) Compute 



X 



(AO 



,x 



\x+\« 

K ^ 

k = l 



fix 



(fc)l 



uniformly 
, a;( L ) uni- 



(15) 



\X~ 



L 



fix 



(16) 



It is easily verified that E[Z+] = Z+ and E[Z~] = ZJ. 

One way to draw samples uniformly from X + and/or X~ 
is by drawing samples , uniformly from X and 

partitioning them according to the sign of f(x^ 1 '). 

B. Ogata-Tanemura Method ^11^ , / [72y 



1) Draw samples x 



(1) jc(2) 



from 



X + according to pj{x), as in ( joj , and samples 
2,(1)^(2) ^ a;( £ ) , . . . , x( L ) from A" - according to 
pj{x), as in jl4|). 



2) Compute 



— v— 

K\X + \t[fix^) 



LX 



It is easy to prove (see Appendix A) that E[f + ] 

E[f"] = 



(17) 



(18) 



□ 

it and 



IV. Estimating \X+\, \X~\, and \X°\ 

Again, we propose two different methods, one for uniform 
sampling and another for sampling from p\f\. In each case, 
the same samples as in Section [Til] can be used. 



A. Uniform Sampling 

1) Draw samples x^^x^,. 
from X. 

2) Compute 

I.V 



,(*0 



K 



r = 



K 

\X\ 
K 

\X 



k=l 
K 



£[/(**) <o] 



k=l 
K 



e =^}2[fix k )=o} 



k=l 



uniformly 

(19) 

(20) 

(21) 
□ 



In these equations, [•] denotes the Iverson bracket fl3{ p. 
24], which evaluates to one if the condition in the bracket 
is satisfied and to zero otherwise. It is easy to prove that 

EK+] = \X+\, E[£-] = \X~\, and E[£°] = \X% 



B. Sampling from p\f\ 
We assume |^f°| = 0. 
1) Draw samples iW, x^ 2 \ . 
cording to p\f\, as in Q. 



S k \ . . . , x( K ) from X ac- 



2) Compute 



■ _ 1 f 1 

K^fixW) 

1 K 1 

^^1/(^)1 



(22) 



(23) 



□ 

It is not hard to prove (see Appendix B) that E[A] = 
\ x+ \-\*-\ and E[f] = HL. 

Using d8l, we can then obtain estimates of \X + \ and \X \ 
from 



|Af+| - \X 



x-\ = \x\ 

A, 



1*1 



(24) 
(25) 
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Fig. 1. Estimated log 2 (Z^) vs. the number of samples, for K = 10 5 , 
TV = 6 X 6, and with factors as in \21\ . The plot shows 10 sample paths each 
computed with estimator \15\ . 



V. Extension to the Complex Case 

In (|5]l-([T0|, we partitioned X and Zf according to the sign 
of f(x). In the complex case, we allow more such bins, one 
for each possible argument (phase) of f(x), as illustrated by 



the example in Section VI-B The algorithms of Sections III 



and IV are easily generalized to this setting. 

However, the computation of probabilities in factor graphs 
for quantum probabilities as in [8), can actually be reduced 
to the real case as in Sections [II] and [HI] (as will be detailed 
elsewhere). 

VI. Numerical Experiments 

In our numerical experiments, we consider two-dimensional 
factor graphs of size N — m x m, with binary variables 
xi,x 2 , ■ ■ ■ , x N , i.e., X 1 = X 2 = ■ . ■ = X N = {0, 1}. 

We suppose / : {0, 1}^ — > C, and / factors into 



/(xi, . . .,x N ) 



k, I adjacent 



(26) 



where the product runs over all adjacent pairs (k,£). 

The corresponding Forney factor graph with factors as 
in (26 1 is shown in Fig. [7] where the boxes labeled "=" are 
equality constraints |TJ. 

A. Two-Dimensional Model with Negative Factors 
Let us consider a factor graph with factors as 



1.3, if x k — x e — 
K(Xk,Xi) = ^ 1, ifx k =xi = l 
— 1, otherwise 



(27) 



For this particular case, we prove in Appendix C that \X 
and \X~\ are analytically available as 



\x- 



\x- 



vZV-1 



(28) 



We estim ate using uniform sampling with estimator ( 15 1 
of Sectio n | III-A and the Oagata-Tanemura method with 
estimator of Section |III-B| 




1000 10000 
Number of Samples 



Fig. 2. Same conditions as in Fig. [I] but with K = 10 7 and N = 14 X 14. 



1000 10000 
Number of Samples 



Fig. 3. Estimated i log 2 (2^) vs. the number of samples, for K = 10 7 , 
N = 14 X 14, and with factors as in \21) . The plot shows 10 sample paths 
each computed with estimator 



Some experimental results are shown in Figs. [T] through |3] 
All figures refer to / with factors as in ( 27 1, and show the 
quantity \og 2 {Z^) vs. the number of samples K. 

Figs. [T] and [2] show simulation results using uniform sam- 
pling and Fig. [3] using the Ogata-Tanemura method. 

For N — 6x6, the estimated log 2 (Z^) is about 1.18, and 
for N — 14 x 14, it is about 1.23. As discussed in Section [ill] 
jj log 2 (|i?7|) can be computed analogously. 



B. Two-Dimensional Model with Complex Factors 

We extend our numerical experiments to factor graphs with 
complex factors as 



K,(x k ,Xl) 



1.5, 
1, 



if x k = Xf — 
if x k = X( = 1 
otherwise 



(29) 



where i is the unit imaginary number. 
We define 



X^ = {x e X : f(x) € iR >0 } 
x { -^ = {x e X : f(x) e -«>o} 



(30) 
(31) 
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Fig. 4. Estimated log 2 (|A'+|) vs. the number of samples, for K 



N = 15 X 15, and with factors as in 
each computed with estimator (19) . 



10 5 , 

[29). The plot shows 10 sample paths 



Fig. 5. Estimated ^ log 2 (.Z^) vs. the number of samples, for K = 10 6 , 
N = 6 X 6, and with factors as in {29). The plot shows 10 sample paths each 
computed with estimator (15) . 



I : x > 0}. 

\x-\, \X&\ 



where M >0 — {x E 
To estimate \X + \. 
uniform sampling of Section |IV-A| by first drawing samples 



and \X( we can apply 



„(!) ,,,(2) 



uniformly from X, and then using the samples 
in the relevant estimators according to the value of f(x( k >), 
e.g., in (M9p if f(xW) is a real positive number. 



For a factor graph of size N = 15 x 15, Fig. |4] shows the 
estimated log 2 (|A' + |) vs. the number of samples K. We obtain 
log a (|#+|) « 223. 

We again apply uniform sampling to estimate Zf, see 
Section [III- A Some experimental results are shown in Figs. [5] 
and [6] All figures refer to / with factors as in ( 29 1, and show 
the quantity jj log 2 (Z^) vs. the number of samples K. 

In Fig.|5j we have N — 6 x 6 and the estimated log 2 (Zy~) 
is about 1.26. In Fig. |6j the estimated -^log 2 (^!~) is about 
1.38 for a factor graph of size N = 15 x 15. 

VII. Conclusion 

We have shown that Monte Carlo methods as in |(5) can 
be extended to estimate the partition function of factor graphs 
with negative and complex factors. However, the cancellation 
problem of partial partition functions as in 
addressed. 



1 1 1 has not been 



Appendix A 



Suppose samples x^ >,x 



(i) x ( 2 ) 



»C*0 are drawn from X^ 



according to p~j(x), as in jl3ji. We have 

K 



E[r^ 



i 



r l 



K\X+\£-if(X<») 
K\X+\ ^ ^ fix) 

1 1 k=l x€X+ J V ' 



K 



\x+\ 



—y 

1 



(32) 

(33) 

(34) 
(35) 
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Number of Samples 

Fig. 6. Estimated i log 2 (Z^) vs. the number of samples, for K = 10 s , 
TV = 15 X 15, and with factors as in (29) . The plot shows 10 sample paths 
each computed with estimator (17) . 



□ 

The proof of E[r _ l = follows along the same lines. 
Appendix B 

Suppose samples a^ 1 ', x^ 2 ', . . . , x^ K > are drawn from X 
according to p\f\, as in Q. We have 



K 



ti L /(* (fc) )- 

p\M x ) 



k E E /!n 



%i ttx f{x) 



\x+\ - \x- 



J \S\ 



(36) 

(37) 
(38) 

(39) 

□ 



The proof of E[T] 



1*1 



follows along the same lines. 



Appendix C 



We consider a two-dimensional factor graph of size 
N = m x m, where m is finite and m > 2, with factors 



n(x k ,x e ) 



a, if Xk = X( 
—a, otherwise 



(40) 



where net and a^O. 

We use the normal factor graph duality theorem [6| to show 
that for this choice of factors, Zf, as defined in ([T}, is zero. 

Consider the dual of the Forney factor graph with factors as 
in (|40li. In the dual graph, the equality constraints are replaced 



by XOR factors, and each factor (40 1 by its two-dimensional 



Fourier transform which has the following form 



4a, if LJk = toe 
0, otherwise 



1 



(41) 



The corresponding Forney factor graph of the dual graph is 
shown in Fig. [8] where the unlabeled boxes represent factors 
as in ( |4"Tj ). 

Let us denote the partition function of the dual graph by Zd- 
Note that, each factor i/(u>k,<^i) is non-zero if u) k = tog = 1. 
Therefore, only the all-ones pattern might have a non-zero 
contribution to Zd- But this pattern does not satisfy the XOR 
factors of degree three in the dual graph, therefore Zd = 0. 
Using the normal factor graph duality theorem (6] Theorem 2], 
Q, we conclude that Zf — 0. Therefore, using 
we obtain 



11 



and ([12}, 



Z 



J f 



Putting a = 1 (or a = 
and hence the following 



l/l 

■ "t ~ 2 
-1), we have 

-ZJ = 2 N ~ l 



z+ = \x+\ 
-zj = \x-\ 



Thus 



\x~ 



yiV-1 



(42) 
(43) 

(44) 
(45) 

(46) 

Note that for m = 2, we have \X + \ = \X\, Z^ = Z f , and 
ZJ = 0. Finally, note that, we can still show Zf = if a € C. 
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